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Basic research with pigeons on behavioral momentum suggests that differential reinforcement of 
alternative behavior (DRA) can increase the resistance of target behavior to change. This finding 
suggests that clinical applications of DRA may inadvertently increase the persistence of target behavior 
even as it decreases its frequency. We conducted three coordinated experiments to test whether DRA 
has persistence-strengthening effects on clinically significant target behavior and then tested the 
effectiveness of a possible solution to this problem in both a nonhuman and clinical study. Experiment 
1 compared resistance to extinction following baseline rates of reinforcement versus higher DRA rates 
of reinforcement in a clinical study. Resistance to extinction was substantially greater following DRA. 
Experiment 2 tested a rat model of a possible solution to this problem. Training an alternative response 
in a context without reinforcement of the target response circumvented the persistence-strengthening 
effects of DRA. Experiment 3 translated the rat model into a novel clinical application of DRA. Training 
an alternative response with DRA in a separate context resulted in lower resistance to extinction than 
employing DRA in the context correlated with reinforcement of target behavior. The value of 
coordinated bidirectional translational research is discussed. 
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Applied Behavior Analysis has proven re- 
markably effective at reducing a wide range of 
unwanted target behaviors and replacing these 
behaviors with prosocial alternatives (Iwata et 
al., 1997). Differential reinforcement of alter- 
native behavior (DRA) has been one of the 
principal tools used to effect these improve- 
ments (Petscher, Rey & Bailey, 2009) . Despite 
the development and refinement of behavior- 
change technologies over the decades, Ap- 
plied Behavior Analysis has made limited 
progress in achieving long-term maintenance 
and generalization of treatment gains. Osnes 
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and Lieblein (2003) reported that most 
clinical studies fail to demonstrate the dura- 
bility and generality of treatment effects. One 
impediment to resolving the problem of 
treatment relapse, or maintenance and gener- 
alization failure, is the lack of understanding 
why target behaviors that have been eliminated 
in one context later resume in that or other 
contexts. However, developments in behavior- 
al momentum theory during the past two 
decades may provide a model for understand- 
ing treatment relapse and the conditions 
under which it is more and less likely to occur. 

Nevin’s formulation of behavioral momen- 
tum theory has shown there are two indepen- 
dent features of the discriminated operant: (a) 
its ongoing response rate, and (b) the resis- 
tance of this response rate to change when 
disrupted by operations such as extinction, 
satiation, alternative reinforcement, and dis- 
traction (Nevin, 1992; Nevin & Grace, 2000; 
Nevin, Tota, Torquato & Shull, 1990). Ongo- 
ing response rate is known to be a function of 
the contingency between responding and its 
consequences (i.e., response-reinforcer rela- 
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tions) . By contrast, the resistance of baseline 
response rate to change is a function of the 
total rate of reinforcement present in the 
context in which reinforced behavior occurs 
(i.e., context-reinforcer relations). These 
response-reinforcer and context-reinforcer 
functional relations have been shown to be 
robust and general across species and popula- 
tions including pigeons (Nevin et al., 1990), 
rats (Mauro & Mace, 1996), goldfish (Igaki & 
Sakagami, 2004), adults with developmental 
disabilities (Mace et al., 1990) and college 
students (Cohen, 1996). 

Demonstrations that resistance to change is 
a function of context-reinforcer relations have 
involved the use of alternative reinforcement 
in the form of time-contingent reinforcer 
deliveries and differential reinforcement of 
alternative behavior (DRA). Nevin et al. (1990, 
Experiment 2) illustrated the persistence- 
strengthening effects of DRA. Figure 1 dia- 
grams their baseline multiple concurrent 
schedule of reinforcement. Pigeons were 
presented with two color-illuminated response 
keys mounted on an operant panel. Pecks on 
the right key were designated the target 
behavior and pecks on the left key were 
defined as alternative behavior. Condition A 
represents a typical DRA arrangement in 
which comparatively high rate reinforcement 
is provided for an alternative response (45 
reinforcers/hr in this case) while the target 
behavior produces a lower rate of reinforce- 
ment (15 reinforcers/hr). With the alternative 
and target response keys illuminated green, 
the total reinforcement available in the green 
context, or the context-reinforcer relation, 
was 60 reinforcers/hr. Conditions B and C of 
the baseline schedule represented different 
rates of reinforcement for the target response 
without alternative reinforcement. Conditions 
A and C contained the same total number of 
context-reinforcer contingencies (60 reinforc- 
ers/hr) and Conditions A and B contained the 
same number of response-reinforcer contin- 
gencies for the target right key response (15 
reinforcers/hr). This arrangement permitted 
a direct test of whether resistance to change 
was a function of response-reinforcer or 
context-reinforcer contingencies. 

Nevin et al. (1990) measured resistance to 
change during the response disruptors of 
extinction and short and long periods of 
satiation. Although DRA in Condition A 
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Fig. 1. Diagram of the 3-component multiple concur- 
rent schedule of reinforcement from Experiment 2 of 
Nevin et al. (1990). Condition A (green key condition) 
models DRA where reinforcement on the alternative left 
key is three times greater than reinforcement of the target 
right key. Adapted from Nevin et al. (1990) and reprinted 
by permission. 

resulted in lower target response rates, resis- 
tance to change was comparable in Conditions 
A and C and greater than Condition B in all 
tests, thus supporting the hypothesis that 
resistance to change is a function of context- 
reinforcer contingencies. These findings dem- 
onstrated that although DRA can reduce 
occurrences of a target response it can also 
make the response more persistent when the 
DRA contingency is degraded in some fashion. 

This finding from basic research has con- 
siderable implications for clinical applications 
of DRA and possible reduction or avoidance of 
treatment relapse. One factor contributing to 
treatment relapse may be that DRA strength- 
ens the persistence of unwanted target behav- 
ior such that when there are compromises to 
DRA treatment integrity there is resurgence in 
target behavior. Basic research supports this 
speculation. For example, Podlesnik and Sha- 
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han (in press) arranged VI 120-s food rein- 
forcement for pigeons in a two-component 
multiple schedule that also included additional 
time-contingent reinforcer deliveries in one 
component. Extinction eliminated responding 
in both components, but when baseline rein- 
forcement was reinstated, responding recov- 
ered more quickly in the component with the 
added reinforcement. Clinical studies have 
reported similar recurrence of target behavior 
following its reduction via time-contingent 
reinforcer deliveries (Ahearn, Clark, Garden- 
ier, Chung, & Dube, 2003; DeLeon, Williams, 
Gregory & Hagopian, 2005). 

The purpose of the present investigation was 
threefold. First, we sought to establish whether 
DRA increases the resistance to extinction of 
target behavior in individuals with developmen- 
tal disabilities (Experiment 1). Second, given 
demonstration of the persistence-strengthen- 
ing effects of DRA, we developed an animal 
model of one possible solution to this clinical 
problem (Experiment 2). We chose an animal 
model to test the effectiveness of this solution 
in order to avoid placing both unwanted target 
behavior and prosocial alternative behavior in a 
clinical population on extinction without evi- 
dence that the proposed solution to persistence 
strengthening would be effective. Finally, in 
Experiment 3 the procedure used in the animal 
model was translated into a novel clinical 
intervention and its effectiveness at avoiding 
or reducing the persistence-strengthening ef- 
fects of DRA was evaluated. 

EXPERIMENT 1 
Method 

Participants and Setting 

Three children with developmental disabil- 
ities who were referred for treatment of severe 
behavior disorders participated in the study. 
Andy, a 7-year-old boy, was diagnosed with 
autism and severe mental retardation. He was 
able to follow simple, one-step instructions by 
using gestures and pointing to pictures. Andy 
attended a private day school for children with 
autism and pervasive developmental disorders. 
Tom and Jackie were inpatients in a children’s 
hospital unit. Tom, a 7-year-old boy, was 
diagnosed with Down’s Syndrome and Atten- 
tion Deficit Hyperactivity Disorder (ADHD). 
He was admitted to the unit for the treatment 
of food stealing, elopement and tantrums. 


Jackie, a 4-year-old girl, was diagnosed with 
severe mental retardation, microcephaly, and 
moderate hearing impairment who was hospi- 
talized for the treatment of aggression and 
self-injurious behavior. She was responsive to a 
few instructions delivered by manual signs. 

Sessions for Andy were conducted in his 
classroom in a partitioned instructional cubi- 
cal measuring 6 m by 3 m containing a table 
and two chairs. Other students and teachers 
present in the classroom participated in their 
regularly scheduled activities, but were not 
within view of the cubical during experimental 
sessions. Sessions for Tom and Jackie were 
conducted in a hospital room (3 m by 5 m) 
with two beds and a table and chairs or in a 
treatment room (3 m square) with a one-way 
observation window. 

Target Behaviors, Measurement, and 
Interobserver Agreement 

Andy’s target behavior was hair pulling, 
defined as an extension of the hand toward 
the hair of another person, including contact 
with the hair or attempts at contact. Tom’s 
target behavior was food stealing, defined as an 
extension of the hand toward another per- 
son’s plate of food during mealtimes. Food 
contact and attempts at food contact were 
recorded. The target behavior for Jackie was 
aggression, defined as hair pulling, scratching, 
hitting, and head butting. Alternative respons- 
es to be reinforced during the DRA phase of 
the study were appropriate toy play (touching or 
interacting with a toy for a minimum of 5 s) for 
Andy and Jackie, and appropriate requests for food 
(asking an adult for a snack item) for Tom. 

Two independent observers collected data 
on the target and alternative behaviors for all 3 
participants. Data on these behaviors were 
recorded with a count within a 10-s interval 
recording procedure using either computer 
software or paper and pencil. The second 
observer collected data on an average of 40% 
of the sessions across conditions and partici- 
pants (range = 31% to 45%). Occurrence 
agreement for the target behaviors, calculated 
on a point-by-point basis, averaged 93% for all 
3 participants (range = 89% to 100%). 

Procedures 

Pre-study functional analysis. Functional be- 
havioral assessment interviews (adapted from 
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Iwata, Wong, Riordan, Dorsey & Lau, 1982) 
and a functional analysis (Iwata, Dorsey, Slifer, 
Bauman and Richman (1982/1994) indicated 
that Andy’s hair pulling was maintained by 
adult attention and Jackie’s aggression was 
maintained by access to food. An interview and 
home visit suggested that Tom’s food stealing 
was maintained by access to food. 

Baseline (BL). For Andy, two teachers in the 
preschool served as experimenters. Andy sat at 
his desk in the cubical with several toys 
available. One teacher sat on the floor 1 m 
from Andy and read a magazine. Another 
teacher was positioned behind Andy on the 
other side of the partition and was able to 
observe his behavior. Contingent on hair 
pulling or attempts to pull the hair of the 
teacher present in the cubicle, the teacher 
positioned behind the partition entered the 
cubicle and provided a verbal reprimand on a 
variable ratio 3 (VR 3) schedule. Immediately 
following the reprimand the teacher returned 
to her position behind the partition. No 
demands or other forms of attention were 
provided during this condition. 

For Tom and Jackie, a therapist served as 
experimenter. Tom and the therapist sat at a 
table in a therapy room. Tom was given a 
hospital meal to eat. Approximately once every 
30 s during Tom’s meal, the therapist ate 
snack foods that Tom preferred from a plate 
positioned approximately 0.67 m from Tom. 
Food stealing resulted in access to bite-size 
pieces of food on a fixed ratio 1 (FR 1) 
schedule. Attempts at food stealing were not 
blocked and the therapist did not provide 
attention contingent on food stealing. 

Jackie and the therapist were in a hospital 
room and the therapist held a box of preferred 
snack food in her hand. Aggressive responses 
resulted in the therapist providing Jackie with 
one piece of the snack food on a variable 
interval 60-s schedule (VI 60 s) ; a limited hold 
was not employed. Interval values for all VI 
schedules in the experiment were calculated 
using the Fleshler-Hoffman (1962) VI genera- 
tor formulas and were signaled to the instruc- 
tor via audiotape with earphones. 

Differential reinforcement of alternative behav- 
ior (DRA). An alternative to target behavior was 
reinforced under DRA schedules that were 
substantially richer than those supporting the 
target behavior. These programmed values 
expressed as the percentage of the baseline 


rate of reinforcement for target behaviors were 
195% for Andy, 165% for Tom, and 185% for 
Jackie. 

Andy sat at his desk with several toys 
available as in baseline. Immediately before 
each session, the teacher who had been 
positioned behind the partition used a grad- 
uated prompt hierarchy (gesture, verbal, phys- 
ical prompts) to assist Andy to interact 
appropriately with the toys for approximately 
15 s. Following this assistance, the teacher 
went to her position behind the partition. 
During the session, the teacher present in the 
cubicle provided descriptive praise statements 
contingent on appropriate play on a VI 30-s 
schedule. The baseline procedures and con- 
tingencies for reinforcing hair pulling re- 
mained in effect to more closely replicate 
Experiment 2 of Nevin et al. (1990). 

For Tom, appropriate requests for food were 
reinforced on an FR 1 schedule. Contingent on 
an appropriate request for food, the therapist 
praised the request and handed one piece of 
preferred snack food to Tom. Food thefts 
continued to be reinforced as in baseline. 

For Jackie, social reinforcement followed 
alternative behavior even though her target 
behavior, aggression, was maintained by access 
to food. We used this arrangement to repre- 
sent the use of DRA in cases where the 
reinforcer maintaining the target behavior is 
unknown or maintained by automatic rein- 
forcement. Five seconds of toy play was 
reinforced on a VR 2 schedule and aggression 
continued to produce access to food on a VI 
60-s schedule as in baseline. 

Extinction (EXT). For all 3 participants, 
response blocking was used to effect an 
extinction operation. Circumstances replicat- 
ed those of baseline, with two exceptions. First, 
attempts to pull hair, steal food, or engage in 
aggression were blocked by the teacher or 
therapist in the room. Blocking consisted of 
the adult placing his/her hand or arm in front 
of the participants’ hand each time the 
participant attempted to engage in the target 
behavior. Second, the reinforcer maintaining 
each participant’s target behavior was with- 
held. No other interactions took place be- 
tween the experimenters and participants. 

Experimental Design 

The experimental conditions described 
above were presented to participants in a 
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sequence of phases that arranged extinction to 
alternately follow baseline and DRA reinforce- 
ment in a counterbalanced order across 
participants. The sequence order for Andy 
was BL-DRA-EXT-BL-EXT. For Tom and 
Jackie, the order was BL-EXT-BL-DRA-EXT. 
The duration of each phase was determined by 
evidence of visual stability in the data during 
baseline and DRA and an apparent extinction 
process during extinction. 

Results and Discussion 

Figure 2 presents target response rates for 
the 3 participants across all phases of the 
study. In the initial baseline, Andy’s hair pulls 
occurred at high rates (M = 186.7/hr; range = 
172 to 209; top panel). The addition of the 
DRA intervention resulted in a sharp decrease 
in hair pulling after 2 sessions that generally 
continued over 19 sessions ( M = 65.0/hr; 
range = 29 to 171). Implementation of 
extinction via response blocking in the third 
phase resulted in a large increase in hair 
pulling during the first extinction session 
(266/hr) that decreased steadily to near-zero 
levels after 36 sessions of extinction (M = 
51.7/hr; range = 7.8 to 266). The return to 
baseline conditions in the fourth phase result- 
ed in an immediate resumption of high and 
steady rates of hair pulling, although to a lower 
level than the initial baseline (M = 103.8/hr; 
range = 76 to 175). The second implementa- 
tion of extinction in the subsequent phase 
resulted in an immediate reduction in hair 
pulling without an extinction burst. Flair 
pulling decreased to near-zero levels after 
three sessions and remained at those levels 
for the remaining eight sessions of extinction 
(M = 14.8/hr; range = 7.8 to 46.9). Tom’s 
first baseline phase resulted in variable but 
stable levels of food thefts (M = 74.2/hr; 
range = 20 to 119). The first session of the 
initial extinction phase produced a burst of 
food thefts to 371 per hour followed by an 
immediate decrease in food thefts at the third 
session to near-zero levels that remained low 
during the next 10 sessions (M = 52.6/hr; 
range = 1 to 371). The return to baseline 
recaptured high rates of food thefts (M = 
106.3/hr; range = 66.7 to 166.6) and the 
subsequent DRA intervention produced sub- 
stantial reductions in food stealing over the 
course of 19 sessions, although food thefts 
resurged temporarily on the 17 th session of 


DRA (M = 24.7/hr; range = 1 to 150). Tom’s 
second extinction phase did not result in an 
initial burst but food thefts continued inter- 
mittently in the baseline range over the first 22 
sessions and took 30 sessions to consistently 
occur at near-zero rates. Finally, Jackie’s initial 
baseline rates of aggression were highly vari- 
able but stable (M = 285/hr; range = 45.5 to 
636.4). Blocking aggression produced an 
immediate reduction in the target behavior 
during the first extinction phase that re- 
mained consistently at zero or near-zero levels 
after 8 sessions. The second baseline phase saw 
a resumption of high and variable rates of 
aggression ( M = 427.02/hr; range, = 72.7 to 
854.6). Although the DRA intervention that 
followed the second baseline reduced rates of 
aggression, treatment effects were not as large 
as those observed for Andy and Tom (M = 
257.0/hr; range = 62 to 632). The extinction 
phase that followed DRA required 34 sessions 
before aggression was reduced consistently to 
zero or near-zero levels (M = 135.3/hr; range 
= 0 to 895.9) . Rates of the alternative behavior 
during the DRA intervention varied widely 
across participants. The occurrence of appro- 
priate toy play was 383/hr and 17/hr for Andy 
and Jackie, respectively. Mean appropriate 
requests for food were 177/hr for Tom. 

The comparison of primary interest in 
this experiment is the relative resistance to 
extinction following baseline versus DRA 
rates of reinforcement. Figure 3 superimposes 
target behavior response rates during the 
two extinction phases expressed as the pro- 
portion of the preceding baseline response 
rate for all 3 participants. Reductions in the 
target behaviors during extinction took three 
times or more longer for all 3 participants. 
The number of extinction sessions following 
DRA versus baseline was 36 versus 10 for Andy 
(360% more following DRA); 36 versus 12 for 
Tom (300%), and 37 versus 12 for Jackie 
(308%). 

The results of Experiment 1 demonstrated 
the persistence-strengthening effects of DRA 
on target behavior. Although DRA reduced 
occurrences of target behaviors for all 3 
participants, DRA was associated with greater 
resistance to change during the adjacent 
extinction phase compared to extinction 
following baseline rates of reinforcement. 
These results are consistent with and predicted 
by Experiment 2 of Nevin et al. (1990). 
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Fig. 2. Experiment 1: Rates of target behaviors during baseline, DRA and extinction conditions across participants. 


EXPERIMENT 2 

Experiment 1 illustrated the capacity of DRA 
to increase resistance to extinction of problem 
behavior. Experiment 2 evaluated a rat model 
of a possible solution to this untoward side 
effect of DRA. According to behavioral mo- 
mentum theory, DRA increases persistence by 
adding reinforcement to the context in which 
the target behavior has a history of reinforce- 
ment. As Nevin et al. (1990, Experiment 2) 
demonstrated, reinforcing alternative behavior 
(the left key in condition A shown in Figure 1) 


increased the persistence of the target behav- 
ior during extinction tests (the right key in 
condition A, Figure 1). In our rat model, an 
alternative response was first trained in a 
context separate from the one in which the 
target behavior occurred. In the extinction 
test, the discriminative stimuli correlated with 
the alternative response were introduced to 
the context correlated with the target behav- 
ior. This practice is contrary to the widespread 
use of DRA in the context in which the target 
behavior occurs (e.g., Carr & Durand, 1985; 
Fisher et al., 1993). We hypothesized that the 
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Fig. 3. Experiment 1: Comparison of rates of target behaviors during extinction following baseline reinforcement 
and DRA reinforcement. Data are expressed as the proportion of the preceding baseline response rate. 


separate context training procedure would 
result in less resistance to extinction than 
DRA implemented in the target behavior 
context. 

We chose to test our hypothesis with a rat 
model for two reasons. First, it permitted 
precise control of the subjects’ historical 
experience with specific discriminative stimu- 
li— reinforcer delivery pairings, which remains 
largely unknown in human participants. Sec- 
ond, by first testing this hypothesis with rats, 


we did not have to place alternative prosocial 
behavior on extinction with human clinical 
participants without having a basis for expect- 
ing the hypothesis would be supported. 

Method 

Subjects 

Four experimentally-naive male Charles 
River CD rats (MV 54 to MV 57) were used 
in this experiment. The animals were between 
the ages of 2 and 4 months at the beginning of 
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the experiment and were housed individually 
throughout the experiment. The rats were 
maintained at 80% of their free-feeding weight 
through supplemental feeding. Water was 
continuously available in each animal’s home 
cage. 

Apparatus 

Experimental sessions were conducted in 
four BRS/LVE two-lever operant chambers for 
rats (Model RTC-002). The left and right levers 
protruded 2.2 cm into the chamber from the 
right wall (operant panel) and each lever 
required a force of 0.15 N to operate. A bank 
of three jeweled lights was located 4.3 cm 
above each lever. Only the center light of each 
bank of lights was used as the visual stimulus 
and provided either flickering or steady 
illumination. An automatic food dispenser 
delivered 45-g Noyes pellets to a scoop-type 
chute centered on the operant panel 1.2 cm 
above the grid floor. 

The experimental chambers were placed in 
MED Associates sound and light attenuating 
enclosures (Model ENV-015M). Additional 
sound attenuation was provided by a ventila- 
tion fan and a Grason-Stadler noise generator 
(Model 901B) that produced masking noise 
within 1 m of the enclosures. A computer 
using MED Associates notation was used to 
program stimuli and record responses through 
a MED Associates interface. 

Procedure 

Preliminary training. Responding on the left 
and right levers was separately established 
using differential reinforcement of successive 
approximations. This was followed by contin- 
uous reinforcement (CRF) and fixed-ratio 
(FR) training in which the reinforcement 
schedule was gradually thinned to FR15 across 
3 to 9 days. The order in which training 
occurred on the left and right levers was 
counterbalanced across rats. Subsequently, 
a series of concurrent VI VI schedules 
was introduced and the schedules were grad- 
ually thinned from VI 10 s to VI 30 s across 
10 days. 

Baseline. Baseline sessions consisted of three 
cycles of a three-component multiple concur- 
rent schedule of reinforcement. Each compo- 
nent was 6 min in duration and was presented 
quasirandomly without replacement within a 


session. Components were separated by a 30-s 
intercomponent period without any experi- 
mentally-programmed events occurring. Thus, 
each cycle lasted 19.5 min and the total session 
duration was 58.5 min. The stimulus arrange- 
ments for the three components during 
baseline are illustrated in the top panel of 
Figure 4. Component 1 was accompanied by a 
darkened left light and a right flicker light at a 
rate of 1 flick per second (1 f/s). Both left and 
right lights flickered at 5 f/s during Compo- 
nent 2. During Component 3, the right light 
was darkened and the left light was on 
continuously. Baseline sessions were conduct- 
ed daily. The experimental design involved 
several baseline sessions followed by a single 
extinction session. The number of baseline 
sessions for MV 54 to MV 57 was 41, 39, 20, and 
39, respectively. 

The arrangement of concurrent reinforce- 
ment for each of the three baseline compo- 
nents is also illustrated in the top panel of 
Figure 4. A single VT 30-s schedule operated 
continuously throughout the session and 
arranged reinforcers for either the left or 
right lever press through a probability gate (p) . 
In Component 1, extinction was arranged on 
the left lever (p = 0) and the programmed 
reinforcement rate per hr on the right lever 
was 24 food pellets (p = .2). During Compo- 
nent 2, the programmed rate of reinforcement 
was 96 (p = .8) food pellets per hr on the left 
lever and 24 (p = .2) on the right lever. In 
Component 3, 96 (p = .8) food pellets per hr 
were programmed on the left lever and 
extinction was arranged on the right lever (p 
= 0). Thus, the combined reinforcement 
arranged in Components 1 and 3 equaled 
the rate of reinforcement provided in Com- 
ponent 2. Reinforcers programmed but not 
earned when each component ended were 
cancelled. For each individual animal, the 
response rate per hr during baseline was 
judged as stable by visual inspection before 
the extinction test was conducted. Additional- 
ly, performance was deemed stable when it 
met both of the following criteria: (a) response 
rates did not exceed or fall below ± 15 
responses/hr on the right and ± 10 re- 

sponse/hr on the left lever in all three 
components for five consecutive sessions, and 
(b) stable ordinal relationships across the 
three components on the left and right levers 
for five consecutive sessions. 
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Fig. 4. Experiment 2: Diagram of the discriminative 
stimuli and reinforcement rates in the three-component 
multiple concurrent schedule of the rat model. The top 
panel depicts the baseline arrangement and the bottom 
panel represents arrangements during the extinction tests. 

Extinction test. A single extinction test session 
was conducted after baseline performance was 
judged to be stable. All food reinforcement 
was discontinued during the extinction test by 
rendering the pellet dispenser inoperative. A 
multiple concurrent schedule similar to that 
used in baseline was employed during the 


extinction test. To allow responding in all 
three components to contact extinction dur- 
ing the beginning, middle and end of the test, 
the components were presented quasiran- 
dontly within blocks of three components 
and data were recorded at the end of each 
19.5 min block. Components 1, 2, and 3 were 
presented randomly once per block. The 
extinction session was terminated at the 
completion of the block in which there had 
been no response on the right lever for each of 
the three components. Visual stimuli present- 
ed during the extinction test are displayed in 
the lower panel of Figure 4. Components 1 
and 2 were signaled by the same visual stimuli 
used in baseline. A stimulus compound was 
introduced in a third component that com- 
bined the stimuli correlated with Components 
1 and 3 during baseline. This stimulus 
compound modeled the clinical situation in 
which DRA is used to train an alternative 
behavior in a separate context (baseline 
Component 3) and the discriminative stimuli 
corresponding to DRA training are introduced 
into the context in which the target problem 
behavior has been reinforced (baseline Com- 
ponent 1). Thus, the signaled reinforcement 
history in Component 2 and Component 3 
(the Stimulus Compound of Baseline Compo- 
nents 1 and 3) were equal during extinction 
(120/hr), and were greater than the signaled 
reinforcement history correlated with Compo- 
nent 1 (24/hr). 

Results and Discussion 

The mean terminal absolute response and 
reinforcement rates for the left and right 
levers and the relative response and reinforce- 
ment rates for the right lever (target response) 
are displayed in Table 1. Obtained rates of 
reinforcement for both response alternatives 
(left and right levers) across all three compo- 
nents approximated the programmed rates of 
reinforcement for each alternative, suggesting 
experimental control over the absolute and 
relative rates of reinforcement. For all subjects, 
within-component response rates on the left 
lever were comparable for Components 2 and 
3 that programmed 96 pellets/hr and were 
lowest in Component 1 (0 pellets/hr). Re- 
sponse rates on the right lever were highest in 
Component 1 (24 pellets/hr with no alterna- 
tive reinforcement) and lowest in Component 
3 (0 pellets/hr) for all subjects. An examina- 
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Table 1 

Absolute and relative response and reinforcement rates for the target (right) and alternative 
responses (left) during Baseline in Experiment 2. Each value is the mean of the final five sessions. 


Rat 

Response rate (min) 

Reinforcement rate (hr) 

Relative rates 

Left 

Right 

Left 

Right 

Total 

Resp. 

Reinf. 




Component 1 




MV 54 

4.89 

19.30 

0.00 

24.67 

24.67 

0.81 

1.00 

MV 55 

6.09 

27.79 

0.00 

18.67 

18.67 

0.84 

1.00 

MV 56 

6.50 

13.29 

0.00 

18.00 

18.00 

0.68 

1.00 

MV 57 

4.82 

21.52 

0.00 

26.00 

26.00 

0.80 

1.00 




Component 2 




MV 54 

40.43 

12.30 

94.67 

18.00 

112.67 

0.23 

0.16 

MV 55 

48.14 

16.27 

94.00 

18.67 

112.67 

0.25 

0.17 

MV 56 

37.24 

4.52 

91.33 

24.67 

116.00 

0.11 

0.22 

MV 57 

34.71 

8.98 

90.00 

14.00 

104.00 

0.19 

0.17 




Component 3 




MV 54 

40.84 

6.80 

92.67 

0.00 

92.67 

0.14 

0.00 

MV 55 

60.45 

3.27 

95.33 

0.00 

95.33 

0.05 

0.00 

MV 56 

42.52 

2.81 

96.67 

0.00 

96.67 

0.06 

0.00 

MV 57 

37.41 

3.36 

88.67 

0.00 

88.67 

0.07 

0.00 


tion of the absolute and relative response rates 
in Components 1 and 2 (both arranging 24 
pellets/hr) shows that for all subjects target 
responding was lower in the component that 
arranged response-contingent alternative rein- 
forcement (i.e., DRA in Component 2). 

The log proportion of baseline responding 
on the right (target) lever across successive 
blocks of extinction for all 4 rats is presented 
in Figure 5. For all 4 rats, responding took the 
shortest amount of time to extinguish in 
Component 1 where the visual stimuli (1 f/s) 
were correlated with the least amount of 
programmed reinforcement (24/hr) and no 
alternative reinforcement. Conversely, the 
proportion of baseline target response rates 
for all subjects took longest to extinguish in 
Component 2 in which the visual stimuli 
correlated with the overall programmed rates 
of reinforcement (120/hr) were the same for 
the target and alternative responses (5 f/s). 
Target responding in Component 3 that 
compounded the stimuli signaling baseline 
reinforcement for Component 1 (1 f/s) and 
Component 3 (constant light) was markedly 
lower than that in Component 2, even though 
the total programmed reinforcement correlat- 
ed with the discriminative stimuli during 
extinction was equal (120/hr) (for MV-54, 
MV-55, MV-56, and MV-57, respectively, mean 
proportion of baseline log response rate on 
the target lever = .64, .54, .76, and .52 for 
Component 2 and .37, .33, .33, and .31 for 
Compound Cl and C3). Of relevance to the 


main experimental hypothesis, responding 
during extinction was comparable during 
Component 1 and the Compound Stimulus 
for MV 54 and MV 56, and lower in the 
Compound Stimulus than Component 1 for 
MV 55. 

The purpose of Experiment 2 was to model 
a possible solution to the problem of DRA 
increasing the resistance to extinction of 
unwanted target behavior. Baseline Compo- 
nent 1 represented reinforcement of the 
target response only, Component 2 simulated 
DRA reinforcement of alternative behavior in 
the same context in which the target response 
is also reinforced, and Component 3 repre- 
sented establishing an alternative response 
with high rate reinforcement in a context 
separate from the one correlated with rein- 
forcement of the target behavior. When the 
discriminative stimuli in baseline Component 
3 were placed in compound with the stimuli in 
Component 1, increased resistance to extinc- 
tion was avoided in 3 of 4 subjects. For the 4th 
subject, responding during extinction was still 
less than during Component 2. 

EXPERIMENT 3 

Experiment 2 modeled a method of imple- 
menting DRA that attenuated the persistence- 
strengthening effects of DRA. The model 
suggests that a homologous clinical interven- 
tion may effectively avoid the persistence- 
strengthening effects of DRA and, hence, 
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Component 1 



Blocks of Extinction Blocks of Extinction 


Fig. 5. Experiment 2: Log proportion of baseline response rates during six blocks of a single extinction session for 
Components 1, 2 and Compound of Baseline Component 1 and Component 3 stimuli across 4 rats. 


justify a clinical trial in which both the target 
and alternative responses are placed on 
extinction in a clinical population. As a 
preliminary test of this proposition, in Exper- 
iment 3 we taught a communication response 
using functional communication training (a 
form of DRA) to children with severe disrup- 
tive behavior in a context in which disruptive 
behavior had no history of reinforcement. As 
in Experiment 2, we then compared target 
responding during extinction in the separate 
context DRA to the context in which DRA was 
implemented in the same context in which 


target behaviors had a history of reinforce- 
ment. 

Method 

Participants and Setting 

The participants were 2 males with develop- 
mental disabilities who were admitted to an 
inpatient hospital program for the treatment 
of severe behavior disorders. Mickey was 
11 years old and diagnosed with moderate 
mental retardation. He spoke in simple sen- 
tences and engaged in severe disruptive 
behavior during performance of daily living 
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skills. Paul was 21 years old, and diagnosed 
with severe mental retardation. He was non- 
verbal and engaged in severe disruptive 
behavior during vocational tasks. Sessions were 
conducted in three different rooms on an 
inpatient hospital unit, each containing a table 
and task materials. 

Target Behaviors, Measurement, and 
Interobserver Agreement 

Disruptive behavior was the target for both 
participants. Mickey’s disruptive behavior was 
defined as throwing task items, pounding on 
the table, shouting and slapping instructors. 
Paul’s disruptive behavior was defined as 
throwing or destroying task materials, pound- 
ing on the table, kicking the underside of the 
table with his knees, and slapping instructors. 
Requests were defined as speaking the phrase 
‘Break please’ (Mickey) or touching a 7.6 cm 
by 20.3 cm card with the printed word ‘Stop’ 
on it (Paul). 

Data on target behaviors, requests for 
breaks, and reinforcer deliveries were collect- 
ed using a continuous count within a 10-s 
interval recording procedure during 10-min 
sessions. Interobserver agreement was calcu- 
lated on a minimum of 37% of the sessions. 
Mean occurrence agreement was 87% (range 
= 61% to 100%) across participants. 

Pre-Study Functional Analysis 

Prior to the study, functional analysis pro- 
cedures described for Andy and Jackie of 
Experiment 1 indicated that both participants’ 
disruptive behavior was maintained primarily 
by negative reinforcement in the form of 
escape from demands. 

Baseline Multiple Concurrent Schedule 

Baseline conditions employed the multiple 
concurrent schedule of reinforcement depict- 
ed in the top portion of Table 2. Components 
were presented to participants in random 
order without replacement with 5 to 10 min 
between components, during which the ex- 
perimenters did not interact with the partici- 
pants. One or two sets of three components 
were presented per day, 5 days per week. 
Therapists wore different colored hospital 
gowns in the three schedule components. 
Gown colors varied by component as noted 
in Table 2. Each component was also conduct- 


ed in a different room to promote component 
discrimination. In order to control for the 
presence of two instructors in Component 2, 
two instructors were present during all three 
components for Mickey and both wore the 
same colored hospital gowns. 

Component 1. This component represented 
reinforcement of target behavior without 
reinforcement of alternative prosocial behav- 
ior. Different task demands were presented to 
participants. Mickey was asked to sort and fold 
shirts and pants and Paul was instructed to 
wash tables and windows using paper towels 
and a spray bottle. Two instructors stood on 
either side of Mickey. The instructor on the 
right side presented task demands while the 
other instructor did not interact with the 
participant. One instructor was present for 
Paul. Sessions began with the instructor 
placing the task materials on the table and 
saying, “Do your work, please.” Following this 
initial instruction, instructions to perform 
relevant task steps were presented using a 
graduated three-step prompt hierarchy (verbal 
prompt, modeling, physical prompt) with 
prompts separated by 5 s. Contingent on 
disruptive behavior, the instructor said, “Okay, 
take a break,” and moved at least 2 m away 
from the participant. No programmed inter- 
action took place during the break. Escape 
from demands was arranged on a VI 75-s 
schedule of negative reinforcement for which 
interval values were calculated using the 
procedure of Fleshier and Hoffman (1962). 
The completion of each interval was signaled 
to the instructor via earphones. 

Component 2. This component represented 
negative reinforcement of alternative behavior 
in the same context in which disruptive 
behavior was negatively reinforced. Procedures 
in Component 2 were identical to those of 
Component 1 except that an instructor pro- 
vided prompts to request a break from 
instruction on a fixed interval 20-s (FI 20-s) 
schedule. Prompts were “Ask if you want a 
break” for Mickey and “Point to the card if 
you want a break” for Paul. Contingent on 
requests for breaks, the instructor provided a 
break as described in Component 1. If the 
participant requested a break prior to the 
elapse of the FI 20-s interval, the instructor 
said, “Okay, just a little while longer,” and 
continued instruction until the reinforcement 
interval elapsed. Disruptive behavior contin- 
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Table 2 

Discriminative stimuli (gown color and room) and programmed rates of reinforcement per hour 


during the baseline 
Experiment 3. 

multiple concurrent schedule 

of reinforcement and extinction tests 

in 

Component 

Discriminative Stimuli 

Rft/hr Requests 

Rft/hr Disruption 

Total BL Rft 

Baseline 

i 

Yellow (Mickey) 
Blue (Paul) 
Room 1 

none 

48 

48 

2 

Blue (Mickey) 
White (Paul) 
Room 2 

180 

48 

228 

3 

White (Mickey) 
Yellow (Paul) 
Room 3 

180 

none 

180 

Extinction Test 

i 

Yellow (Mickey) 
Blue (Paul) 
Room 1 

EXT 

EXT 

48 

2 

Blue (Mickey) 
White (Paul) 
Room 2 

EXT 

EXT 

228 

Compound BL Cl + C3 

Yellow/White (Mickey) 
Blue/Yellow (Paul) 
Room 1 

EXT 

EXT 

228 


ued to be reinforced on the VI 75-s schedule 
described in Component 1. For Mickey, the 
instructor to his left provided instruction and 
reinforcement of requests for a break. The 
instructor to Mickey’s right reinforced disrup- 
tive behavior. For Paul, the single instructor 
reinforced both requests and disruptive behav- 
ior on their respective schedules. 

Component 3. This component represented 
reinforcement of alternative behavior in a 
context that provided no reinforcement of 
disruptive behavior. The task instruction de- 
scribed in Component 1 and reinforcement of 
requests for breaks described in Component 2 
were replicated in the third component. 
However, the instructor did not provide breaks 
from instruction contingent on disruptive 
behavior. 

Extinction Test 

Instruction of each participant’s daily living 
task continued as in baseline during all three 
components. However, all reinforcement of 
target disruptive behavior and requests was 
discontinued in order to test the hypothesis 
that reinforcement of prosocial alternative 
behavior in a separate context from one that 
also reinforced disruptive behavior could 


reduce or avoid the persistence-strengthening 
effects of DRA. The discriminative stimuli and 
arranged baseline rates of reinforcement are 
presented in the lower portion of Table 2. The 
discriminative stimuli for Components 1 and 2 
were identical to those in baseline. The 
discriminative stimuli in a third component 
during extinction comprised a compound of 
the baseline stimuli for Components 1 and 3. 
This represented the clinical situation in 
which a prosocial alternative behavior, such 
as communication, is first established in a 
context with no history of reinforcement of 
unwanted target behavior prior to introducing 
the discriminative stimuli correlated with 
alternative behavior into the context that has 
a history of reinforcement for target behavior 
(see lower portion of Table 2). 

Results and Discussion 

Figure 6 presents baseline data from the 
three components of the multiple concurrent 
schedule. For both participants, the highest 
rates of disruptive behavior occurred in 
Component 1 , in which only disruptive behav- 
ior was reinforced (Mickey, M = 50.4 disrup- 
tions /hr, range = 30.0 to 93.5; Paul, M = 
135.7, range = 35.0 to 292.0). Rates of 
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»C 1 Blue Requests — No rft Disruption --- VI 75 s 



Fig. 6. Experiment 3: Disruptions per hour during baseline sessions of the multiple concurrent schedule of 
reinforcement. Instructors wore different colored hospital gowns and conducted sessions in different rooms to 
differentiate the schedule components. 


disruptive behavior were substantially lower in 
Component 2, in which requests were rein- 
forced 3.75 times more often than disruptive 
behavior (Mickey, M = 22.9 disruptions/hr, 
range = 5.5 to 48.3; Paul, M = 34.06, range = 
5.0 to 173.0). The lowest rates of disruptive 
behavior occurred in Component 3, in which 
requests only were reinforced (Mickey, no 


disruptive behavior; Paul, M = 15.2 disrup- 
tions/hr, range = 0 to 75.0). 

Resistance to extinction expressed as propor- 
tion of baseline response rate was greatest in 
Component 2 for both participants (see Fig- 
ure 7) where the discriminative stimuli were 
identical to those in baseline (see lower portion 
of Table 2). Both participants showed an 
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Fig. 7. Experiment 3: Rates of disruptive behavior during extinction sessions expressed as proportion of baseline 
response rates for Components 1 and 2 and a Compound of Baseline Component 1 and Component 3 stimuli. 


extinction burst in Component 2. Mickey’s 
disruptive behavior reduced to low rates after 
four extinction sessions; however, Paul contin- 
ued to engage in high rates of disruptive 
behavior in Component 2 during nine sessions 
of extinction. Component 2 sessions were 
discontinued for Paul prior to effecting an 
extinction process for ethical reasons (his 
disruptive behavior had reduced to below 
baseline rates in Components 1 and the 


Compound Stimulus Component). Resistance 
to extinction was considerably lower in Com- 
ponent 1 where the discriminative stimuli were 
the same as baseline and baseline rates of 
reinforcement were approximately one-quarter 
of those in Component 2. Of primary interest to 
the experimental hypothesis of the study are 
the data on resistance to extinction during the 
Compound Stimulus of baseline Components 1 
and 3. This compound stimulus was correlated 
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with the same reinforcement rates as Compo- 
nent 2. However, rates of disruptive behavior 
during extinction were much lower than in 
Component 2 and were comparable to those in 
Component 1 for both participants where the 
added reinforcement of DRA was absent. These 
results replicate the rat model of Experiment 2 
by showing that the persistence-strengthening 
effects of DRA can be reduced or avoided by 
using DRA in a context separate from that 
correlated with a history of reinforcement of 
unwanted target behavior. 

GENERAL DISCUSSION 

We conducted two clinical studies and one 
basic research experiment that were coordi- 
nated to demonstrate the persistence-strength- 
ening effects of DRA and develop a possible 
solution to the problem. Experiment 1 showed 
that clinical applications of DRA can increase 
the resistance of unwanted target behavior to 
extinction. The duration and response levels 
of the extinction processes were substantially 
greater following DRA reinforcement than 
after baseline reinforcement of target behav- 
ior. In Experiment 2, we devised a DRA 
treatment for target behavior in a rat model 
that was aimed at avoiding or reducing the 
persistence-strengthening effects of DRA. The 
source of the problem is that typical applica- 
tions of DRA involve adding reinforcement to 
the context in which target behavior is, or has 
been, reinforced. When a new alternative 
response such as communication is established 
with DRA, it is generally reinforced at a high 
rate both to teach the response and to 
compete with reinforcement for target behav- 
ior. For example, Fisher et al. (1993) used 
functional communication training (FCT) to 
reduce unwanted target behavior with and 
without extinction of problem behavior. Com- 
munication responses were reinforced on a FR 
1 schedule and this resulted in rates of 
communication ranging from 4.5 per min to 
7.0 per min for one study participant. To avoid 
adding reinforcement to the context in which 
the modeled target behavior occurs, we rein- 
forced an alternative response in a different 
context in our rat model of DRA treatment 
(Component 3). During the extinction tests, 
we arranged a stimulus compound composed 
of the stimuli correlated with reinforcement of 
the target behavior (Component 1) and those 


correlated with reinforcement of alternative 
behavior in a context without reinforcement 
of the target behavior (Component 3). Al- 
though the total reinforcement in the com- 
pound stimulus equaled that of the compo- 
nent that included DRA and reinforcement of 
target behavior (Component 2), resistance to 
extinction was much lower in the component 
with the compound stimulus. We translated 
these findings into a novel approach to DRA 
treatment in a clinical study in Experiment 3. 
Consistent with our rat model, using FCT to 
establish communication in a context without 
reinforcement of target behavior circumvent- 
ed the persistence-strengthening effects of 
DRA during extinction tests. 

These findings have implications for resolv- 
ing some longstanding limitations to Applied 
Behavior Analysis interventions. Long-term 
maintenance and generalization of treatment 
gains are often not reported or achieved 
(Osnes & Leiblien, 2003; Stokes & Osnes, 
1989). One possible reason for this is that 
lapses in treatment integrity can result in 
recurrence of problem behavior. Lapses in 
treatment integrity can occur when rates of 
DRA reinforcement drop precipitously or 
diminish over time resulting in a temporary 
extinction operation or an increase in relative 
reinforcement for target behavior. Burst re- 
sponding with the initiation of extinction is 
widely reported. For example, Lerman, Iwata 
and Wallace (1999) reported extinction bursts 
in 62% of clinical cases in which extinction of 
self-injurious behavior followed reinforce- 
ment-based treatments. Partial compromises 
to treatment integrity have also been related to 
the resumption of problem behavior. DiGen- 
naro, Martens and Kleinmann (2007) identi- 
fied 13 procedural steps for teachers to follow 
to implement a DRA intervention. They found 
significant inverse correlations between the 
percentage of accurately implemented treat- 
ment procedures and occurrences of target 
behavior for three-quarters of their partici- 
pants, ranging from -.45 to -.78. Vollmer, 
Roane, Ringdahl and Marcus (1999) systemat- 
ically varied DRA treatment integrity for 3 
children with developmental disabilities. For 2 
participants, varied levels of treatment integri- 
ty were 0/100 (baseline), 25/75, 50/50, 75/25, 
and 100/0 (full treatment). In their notation, 
75/25 denotes 75% of occurrences of proso- 
cial alternative behaviors were reinforced and 
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25% of target behaviors were reinforced. 
Resumption of target behavior was observed 
when treatment integrity dropped to or below 
50/50. 

Basic research on resurgence predicts that 
such lapses in treatment integrity would result 
in recurrence of target behavior after success- 
ful treatment (Podlesnik, Jimenez-Gomez, & 
Shahan, 2006; Podlesnik & Shahan, in press). 
In the resurgence paradigm, after a target 
response is established, it is then extinguished 
while also reinforcing an alternative response 
(analogous to DRA plus extinction) . If rein- 
forcement of the alternative response is 
subsequently diminished, recovery of the 
target response is observed even though 
reinforcement of the target response does 
not resume. 

A second factor known to contribute to the 
maintenance and generalization failure is the 
reintroduction of stimuli that were previously 
correlated with pretreatment reinforcement of 
target behavior (Osnes & Lieblein, 2003; 
Stokes & Baer, 1977; Stokes & Osnes, 1989). 
Transferring treatment effects from behavior 
analysts to parents or educational personnel is 
impeded by the phenomena of reinstatement 
and renewal. In the reinstatement paradigm, a 
response reinforced in baseline is then elim- 
inated by extinction or alternative reinforce- 
ment. Subsequent time-contingent delivery of 
the reinforcer that maintained baseline re- 
sponding then results in recovery of the target 
response despite ongoing extinction (Shaham, 
Shalev, Lu, de Wit, & Stewart, 2003). DeLeon 
et ah (2005) demonstrated reinstatement 
effects in a clinical population. After extinc- 
tion had reduced target behavior to low levels, 
introduction of reinforcers on a time-contin- 
gent schedule increased rates of target behav- 
ior despite ongoing extinction. The clinical 
problem this poses is that parents and educa- 
tional personnel will almost certainly deliver 
reinforcers noncontingently that previously 
maintained target behavior; thus, reinstate- 
ment effects should be expected. 

Renewal occurs when a response is elimi- 
nated in a context different from the one 
correlated with baseline reinforcement as 
might occur when an individual receives 
treatment in a clinic for unwanted behavior 
that occurs at home. After successful reduction 
of target responding, recurrence of the target 
response is observed when the baseline con- 


text is reintroduced. Poldesnik & Shahan (in 
press) demonstrated renewal effects with 
homing pigeons. A baseline multiple schedule 
of reinforcement (mult VI 120 s VI 120 s VT 
20 s) was presented with a steady houselight 
illuminated, followed by extinction during a 
flashing houselight. After responding was 
eliminated in the presence of the flashing 
light, extinction was then applied with the 
baseline steady houselight in an A-B-A arrange- 
ment. Recovery was observed in both compo- 
nents of the second A phase but was greater in 
the schedule component with the added time- 
contingent reinforcers. 

Results from Experiment 1 demonstrate that 
DRA can strengthen the persistence of un- 
wanted target behavior. Basic research on 
reinstatement, resurgence and renewal further 
suggest the conditions under which these 
persistence-strengthening effects are likely to 
occur. Resurgence effects may result from 
lapses in treatment integrity. Reinstatement 
and renewal effects may interfere with efforts 
to transfer treatment gains first obtained in a 
treatment setting to a natural setting with a 
history of reinforcement of target behavior. 
The growing number of clinical demonstra- 
tions that alternative reinforcement can 
strengthen persistence (Ahearn et ah, 2003; 
DeLeon et al., 2005) points to a significant 
clinical problem in need of solution. 

In addition to the separate context solution 
demonstrated in Experiments 2 and 3, we can 
suggest two additional remedies that differ 
from conventional Applied Behavior Analysis 
practice. First, when a pretreatment functional 
analysis identifies a socially mediated reinforc- 
er that maintains problem behavior, it can 
often be withheld in an extinction operation. 
Matching theory predicts that reductions in 
problem behavior may be achieved with low 
rates of DRA reinforcement when juxtaposed 
with extinction for target behavior (i.e., the 
low-rate DRA solution). This remedy contrasts 
with conventional practice that typically ar- 
ranges high-rate reinforcement for the proso- 
cial alternative behavior (e.g., Fisher et al., 
1993; Vollmer et al., 1999). 

A second possible solution is a variant of the 
separate context solution. Establishing a new 
alternative response often requires high-rate 
reinforcement to compete with the reinforce- 
ment history for unwanted target behavior. 
However, if high-rate DRA is introduced in a 
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separate context and then faded to low-rate 
DRA prior to introducing the DRA discrimi- 
native stimuli to the target context, this may 
avoid adding to the context-reinforcer con- 
tingencies that would contribute to persistence 
strengthening. This separate context schedule 
thinning solution is suggested because persis- 
tence-strengthening effects in humans appear 
to be sensitive to recent rates of alternative 
reinforcement. In Experiment 1, resistance to 
extinction was greater when extinction imme- 
diately followed DRA reinforcement than 
when it immediately followed baseline rein- 
forcement. Ahearn et al. (2003) reported 
similar findings in a clinical population using 
response-independent reinforcer deliveries. 
This remedy departs from the usual practice 
of DRA schedule thinning in the context in 
which target behavior has a history of rein- 
forcement. 

The three experiments reported here rep- 
resent an explicitly bidirectional approach to 
translational research. While the past three 
decades have seen growing recognition of the 
importance of linking basic and applied 
research to promote advancements in behav- 
ioral technologies and establish the broad 
generality of behavioral laws (e.g., Dietz, 
1978; Hake, 1982; Lerman, 2003; Mace, 1994; 
Mace & Wacker, 1994), most translational 
research has focused on establishing the 
generality of basic research to human affairs. 
However, coordinated basic and applied re- 
search can stimulate both sectors to pursue 
research questions and paradigms less likely to 
be followed when basic and applied research- 
ers work separately. Experiment 1 was stimu- 
lated by our collaboration with Tony Nevin on 
clinical applications of behavioral momentum. 
Through this collaboration it became appar- 
ent that Condition A in Experiment 2 of Nevin 
et al. (1990) represented a clinical application 
of DRA (see Figure 1). That high-rate rein- 
forcement of prosocial behavior could inad- 
vertently strengthen the persistence of the 
same unwanted target behavior it reduces was 
counterintuitive. Our clinical demonstration 
that DRA can increase resistance to extinction, 
in turn, stimulated further collaboration 
among basic and applied researchers to 
attempt a solution to this problem. 

The present series of experiments is not 
without limitations. First, Experiment 1 did 
not replicate the extinction following baseline 


and extinction following DRA conditions 
within participants. We chose not to replicate 
the extinction conditions so as not to subject 
the participants to additional lengthy extinc- 
tion processes. However, the principal finding 
in Experiment 1 that DRA can increase the 
persistence of target behavior was replicated in 
both participants in Experiment 3 in the 
multiple schedule design employed. Second, 
the extinction conditions in Experiments 2 
and 3 presented the discriminative stimuli 
correlated with the target response in Compo- 
nent 1 and the compound of Components 1 
and 3 twice as many times as Component 2. 
Although greater resistance to extinction was 
apparent early in the extinction process for 
both rats and humans, replications of this 
research should conduct extinction only with 
the discriminative stimuli in Component 2 and 
compound of Components 1 and 3 to elimi- 
nate this possible confound. 

The present investigation suggests several 
areas for future study that extend the bidirec- 
tional nature of this research and address some 
limitations of our studies. First, we did not 
measure the resistance of alternative behavior 
to extinction. Because strengthening the per- 
sistence of prosocial behavior is an important 
clinical goal, this should be integral to future 
research protocols. Second, the generality of 
the present findings on DRA and those using 
time-contingent schedules (Ahearn et al., 2003; 
DeLeon et al., 2005) should be examined for 
other forms of alternative reinforcement used 
in clinical practice (such as differential rein- 
forcement of other behavior, differential rein- 
forcement of high rates, and differential 
reinforcement of low rates) . Finally, the persis- 
tence-strengthening effects of alternative rein- 
forcement may be best understood within the 
contexts of the reinstatement, resurgence and 
renewal paradigms discussed above. Explicit 
linkage of this research to the clinical problem 
of failures of treatment maintenance and 
generalization may lead to novel solutions to 
an important limitation of Applied Behavior 
Analysis interventions. 
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